Processing of noisy speech using partial phase
نویسندگان
چکیده
This paper explores the possibility of processing noisy speech using signal reconstruction algorithrns frorn Fourier Transform (FT) phase and rnagnitude. Algorithrns have been proposed in the literature for signal reconstruction frorn FT phase alone, or, frorn FT rnagnitude with additional inforrnation in the form of 1-bit phase or signal values. More recently, algorithrns have been proposed for signal reconstruction frorn partial phase (phase inforrnation in selected frequency bands) with cornpensating nurober of signal sarnples. In this paper we exarnine application of these techniques for processing noisy speech. In particular, we show that by selectively processing high signal-to-noise ratio(SNR) regions we can reduce the effect of background additive noise significantly.
منابع مشابه
Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions
Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...
متن کاملImproving the performance of MFCC for Persian robust speech recognition
The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...
متن کاملWaveform estimation using group delay processing
A method of signal waveform estimation from an ensemble of jittered noisy measurements is presented. The method uses group delay functions to perform the ensemble averaging and thus overcomes the difficulty of computing the unwrapped phase function before averaging. We propose a new technique, called group delay processing, to estimate the signal waveform if only a single noisy measurement is a...
متن کاملSpeech Enhancement Using Gaussian Mixture Models, Explicit Bayesian Estimation and Wiener Filtering
Gaussian Mixture Models (GMMs) of power spectral densities of speech and noise are used with explicit Bayesian estimations in Wiener filtering of noisy speech. No assumption is made on the nature or stationarity of the noise. No voice activity detection (VAD) or any other means is employed to estimate the input SNR. The GMM mean vectors are used to form sets of over-determined system of equatio...
متن کاملDNN-Based Amplitude and Phase Feature Enhancement for Noise Robust Speaker Identification
The importance of the phase information of speech signal is gathering attention. Many researches indicate system combination of the amplitude and phase features is effective for improving speaker recognition performance under noisy environments. On the other hand, speech enhancement approach is taken usually to reduce the influence of noises. However, this approach only enhances the amplitude s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1987